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Present invention relates to a novel polynucleotide involved in heritable Parkinson's 
disease (PD), a novel polypeptide encoded by the polynucleotide, and a method for 
diagnosing heritable Parkinson's disease (PD). 

5 Background 

Parkinsonism (MIMl 68600) is a clinical syndrome characterized by bradykinesia, 
resting tremor, muscle rigidity, and postural instability (Gelb et al. 1999). The most 
common cause of parkinsonism is Parkinson's disease (PD). Second to Alzheimer's 
disease, PD is the most conmion neurodegenerative disorder affecting >t% of the 

10 population over 55 years of age (de Rijk et al. 1995). Neuropathological findings in PD are 
loss of pigmented neurons in the brainstem, substantia nigra and locus ceruleus^ with 
intracellular Lewy body inclusions foimd within surviving neurons (Fomo 1996). 

Although PD is considered a sporadic disease, various hereditary forms of parkinsonism 
have been recognized (Vila and Przedborski 2004). A major breakthrough in recent years 

1 S has been the mapping and cloning of a number of genes causing monogenic forms of 
parkinsonism. Genomic multiplication and missmse mutations in the a-synuc!ein gene 
were initially identified in a small number of families with autosomal dominant 
parkinsonism (PARKl/4 [MIM 168601]) (Polymeropoulos et al, 1997; Kruger et al. 1998; 
Singleton et al. 2003; Chartier-Harlm et al. 2004; Farrer et al. 2004; Zarranz et al. 2004). 

20 Subsequently, a-synuclein antibodies were found to robustly stain Lewy bodies and Lewy 
neurites in the substantia nigra in familial and sporadic PD (Spillantini et al. 1997) and 
common genetic variability in the a~synuclein promoter has been implicated in sporadic PD 
(Pals et al. 2004). 

Autosomal recessive mutations in three genes, parW^z, DJ-l and PINKl have been linked 
25 with early-onset parkinsonism (<45 years at onset) (PAEIK2, PARK6 & PARK? [MIM 

602533, 602544 & 608309])(Kitada et al 1998; Bonifati et al. 2003; Valente et al. 2004). A 
large number of pathogenic mutations and rearrangements have been identified in the 
parkin gene reviewed by (Mata et al. 2004), but mutations in DJ-1 and PINK-1 are rare 
(impublished data). 

30 Very recently, five pathogenic mutations were identified in a gene, leucine-rich repeat 

kinase 2 (LRRK2) in six families with autosomal-dominant parkinsonism, linked to the 
PARKS locus [MIM 607060J) (Zimprich et al. 2004a). Paisan-Ruiz and colleagues 
independently confirmed these findings of two pathogenic mutations in a British and 
Basque families (Paisan-Ruiz et al. x2004). 
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Object 

The object of the invention is to isolate a gene or polynucleotide proving inheritable 
parkinsonism, and to use presence of this gene to diagnose a patient before he/her gets sick. 
5 A further object is to use this gene or polynucleotide to transfect a microorganism or 
experimental animal in order to develop a new medicine for treating parkinsonism, or for 
preventing the onset of the illness. 

The invention 

10 Inheritable parkinsonism may be proved by the method according to the characterizing 
part of claim 5, and the other objects are met by a polynucleotid according to the 
characterizing part of claim 1, a recombinant vector according to claim 3» a DNA probe and 
a DNA primer According to claims 4 and 6 respectively, and a peptide according to claim 9. 

1 5 The inventors have isolated a novel LRRIC2 mutation, and this mutation may cause 
development of dominantly inherited PD. By screening healthy persons, one can state 
whether the healthy persons have the mutation, and thus most likely will develop the 
illness. 

As the sequence of the mutated gene is known, microorganisms and fiuther experimental 
20 animals may be transfected, in order to investigate for a new medicine to treat or prevent 
the onset of the illness. 

In the following the invention will be described by reference to a study of PD patients 
and their families. Parts of the study are shown in figures, wherein 

Figure 1 shows a schematic drawing of LRRK2 with predicted protein domains, 
25 Figure 2 shows pedigrees of families with LRRX2 G20 1 9S, 

Figure 3 shows chromosome 12ql2 STR markers on the disease haplotype (PARK 8), 
Figure 4 shows probability of becoming affected by parkinsonism, in LRRK2 G2019S 
carriers, as a function of age, and 

Figure 5 shows aligned amino acid sequences of the activation loop of different human 
30 kinases. 

The inventors identified seven unrelated persons all having the new mutation, from 248 
multiplex kindreds with dominantly inherited PD, and six further unrelated persons from 
three populaton-based series of persons with dominantly inhereted PD. These 13 persons 
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and their families made basis for the inventors' further work. Segregation and linkage 
analysis provides evidence for pathogenicity and an estimate of age-associated penetrance; 
haplotype analysis demonstrates the mutation originates from a common and ancient 
founder. 

5 

Subjects and Methods 

Study subjects 

The patients and controls were examined by neurologists specialized in movement 
disorders. A full history, including family history and neurological examination, was 
10 completed on each patient. Clinical diagnosis of PD required the presence of at least two of 
three cardinal signs (resting tremor, bradykinesia and rigidity), improvement £i:om adequate 
dopaminergic therapy and the absence of atypical features or other causes of parkinsonism. 

LRRK2 sequencing and mutation screening 

15 Blood samples were taken and genomic DNA was extracted using standard techniques. 
Six families (families 194, 281, 3081, 3082, 3083 and 321 1) were known to have a positive 
LOD-score for STR (Short Tandem Repeat) markers in the PARKS locus (Zimprich et al. 
2004b). Amplification of all 51 exons of the LRRK2 gene was performed by polymerase 
chain reaction (PGR) in one patient having PD, from each of these six families. All PCRs 

20 were carried out for each primer set with 20-50 ng of template DNA in a total volume of 
25^1 using a final reaction concentration of 200 ^iM dNTP, Ix PCR-Buffer (Qiagen), Ix Q- 
Solution (Qiagen), and 0.8 |jM of each primer. One unit of Taq polymerase (Qiagen) was 
added to each reaction, AjTiplification was performed using a 57-52*C-touchdown protocol 
over 38 cycles. The primers used for PGR amplification of LRRK2 exons and for 

25 sequMcing are available on request 

The nucleotide sequences of all PGR products were determined by direct sequencing. 
Each PGR product was cleaned by using a Millipore PGR purification plate. Three 
microliters of purified PGR product was used per sequencing reaction with 1 \i\ of either the 
forward or reverse PGR primer and 1 |il of BigDye reaction mix (Applied Biosystems). 

30 Electrophoresis was performed under standard conditions on an ABI 3730 automated 
sequencer (Applied Biosystems). All sequences were obtained with both forward and 
reverse primers. Sequences were analyzed with SeqScape software version 2.1 . 1 (Applied 
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Biosystems) and compared with published sequence of LRRK2 (GenBank accession no. 
AY792511). 

After identification of a heterozygous G2019S (G6055A) mutation in the proband of 
family 3215 (referred to as family 321 1 in Zimprich et al, 2004b), we designed a probe 
5 employing TaqMan chemistry on an ABI7900 (Applied Biosystems) to screen for this 
mutation. First we examined 248 PD patients from families with a known family history, 
consistent with autosomal dominant transmission of a suspected causative gene. Then 377 
Norwegian, 271 Irish and 100 Polish PD patients (constituting the three population series) 
were checked using this assay; and 2260 samples of healthy persons'from similar 
10 populations were also included (1200 US American, 550 Norwegian, 330 Irish and 180 
Polish subjects), the latter to be used as control samples. Mutations were confirmed by 
direct sequencing of PGR products from LRRK2 exon 41. Finally, all participating family 
members of LRRK2 G2019 mutation carriers (affected and unaffected) were screened for 
the mutation. 

1 5 By 6055 G>A or G6055 A it is meant that nucleotide number 6055 of the LRRK2 gene, 
counted from the S*end of the polynucleotide, has changed from G (guanine) to A 
(adenine). This change also causes a change in the polypeptide encoded by the 
polynucleotide, and G2019S denotes a polynucleotide where aminoacid number 2019 is 
changed from G(Glycine) to S(Serine). These shortenings are wellknown to persons skilled 

20 of the art. 

Genotyping ofSTR markers 

Fourteen STR markers were genotyped in mutation carriers and all available family 
members, in all 13 families, for linkage analyses and to determine whether there was a 

25 particular haplotype associated with the LRRK2 mutation. STR markers were chosen to 

span the PARKS region including D12S87, D12S1648, D12S2080, D12S2194, D12S1048, 
D12S1301 and D12S1701. LRRK2 is located between D12S2194 and D12S1048. We also 
developed seven novel STR markers in this region (shown in table 1 below) by searching 
for repeat polymorphisms using RepeatMasker of m silico BAC sequence (UCSC Human 

30 Genome Browser Web site). The labeling of these novel markers reflects their physical 
position relative to the start codon of LRRK2. 
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Table 1. Novel chromosome 12 STR markers 


iVlcLllV.Cl UolIIw 




Physical position 
(bp) On 
chromosome 12 




F- 5'-TTGCAGCTGTAAGGAATTTGGG-3* 


38873779 




R- S'-firATTrTTrAGCCTQAQACCC-3' 






F- S'-TGAAGGArACTGAACAAGATGG-3' 


38974140 




R- 5 *-GCC ATAGTCCTTCCATAGTTCC-3' 






F- 5'-CGCAGCGAGCATTGTACC-3* 


38989214 




R- 5*-CTCGGAAAGTTTCCCAATTC-3' 




rjl9S9S18 

U X ^^£kJ 1 0 


F* 5*-CTGGTATTACCTCAACTGTGGCTC-3* 

X p X^ X X *X X X £ XX_^X^ X X^ X * <* 


39034800 




R' S'-ACTGGTATGTTTAAGCCTGGCAC-3' 




D12S2519 


F: 5'-AGCAGCAGAGAAGATTTCAATAAC-3' 


39116816 




R: 5'-AATCATCTTTGAAAGAACCAGG-3' 




D12S2523 


F: 5'-TAAACOAAGCTCCCTCACTGTAAG-3' 


39147728 




R: 5'-TCTTTGTAGCTGCGGTTGTTTC-3' 




D12S2517 


F: 5*-TCATGAAGATGTCTGTGATAGGGC-3* 


39282976 




R: 5'-CTCTATTGTGAGCAAACTGCATGG-3' 





One primer of each pair was labeled with a fluorescent tag. PGR reactions were carried 
out on 1 0-20 ng of DN A in a total volume of 1 5 jil with final reaction concentrations of 1 50 
jiM dNTP, Ix PCR-Buffer (Qiagen), Ix Q-Solution (Qiagen) and 0.6 |iM of each primer, 
5 with I unit of Taq Polymerase (Qiagen). Amplification was performed using a 57-52'C- 
touchdown protocol over 38 cycles. The PGR product for each marker was diluted by a 
factor of 1 0 to 1 00 with water. One microliter was then added to 1 0 ^1 of Hi-Di Formamide 
and Rox size standard. All samples were run on an ABI 3100 genetic analyzer, and results 
were analyzed using Genescan 3.7 and Genotyper 3.7 software (Applied Biosystems). 
10 Since population allele firequencies were not available from the CEPH database, these have 
been estimated by genotyping 95 unrelated Caucasian subjects, a population based series 
from the United States (shown in table 2 below). 



IS 
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Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


Frequency (%) 


D12S87 (n = 92) 




]50 


0.5 


154 


hi 


156 


27.2 


15S 


33.2 


160 


1L4 


162 


2.7 


164 


6.0 


166 


17.4 


168 


0.5 


D12SI648 (n = 91) 




110 


13.7 


U2 


3.3 


114 


11.0 


116 


4.4 


118 


2.2 


!20 


2.8 


122 


17.0 


124 


3.9 




t, / 




14 1 


130 


8.8 


132 


2.8 


134 


2.8 


136 


1,7 


138 


0.6 


140 


2.2 


142 


1.1 


D 12S70S0 (n » 93"^ 




176 


1.6 


180 


20.2 


184 


44.7 


188 


22.9 




in ft 


1 t zrt \t.\ O f J 






0 6 




40 9 


253 


32.4 


257 


19.9 


261 


4.6 


265 


1.7 


DI2S2514(n = 82) 




284 


11.0 


291 


53.1 


294 


32.3 


297 


1.2 


300 


2.4 
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Table 2, Allele frequencies of Park 8 Markers 


{Marker and allele (bp) 


Frequency r%) 


1 5 (n — yj) 






3 2 


919 


26.6 


216 


18.6 




99 0 




90 7 


09R 


S 




9 7 


rs /yoojju v.^* yu^ 




1 






0 A 

y.*» 


DS12S2516 




252 


37,3 


254 


62.7 


rs 1427263 (n = 89) 




A 


63.6 


C 


36.5 


rs] 116013 (n«88) 




A 


49.4 


G 


50.6 


rsl 1564148 (n = 88) 




A 


26.1 


T 


73.9 


D12S2518(N =90) 




154 


79.7 


168 


15,9 


170 


A A 

4.4 


D]2S519(n = 72) 






90 ^ 


134 


22.6 


138 


22.6 


140 


25.3 


DI2S2520(N-85) 




248 


8.2 


251 


7.6 


254 


10.0 


257 


54.1 


260 


20.0 
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Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


Frequency (%) 


DI2S2521 {N«93) 




311 


0,5 


315 


10.8 


319 


20.4 


323 


8.1 


327 


7.0 


331 


8.1 


335 


0.5 


355 


M 


359 


7.5 


363 


13.4 


367 


7.0 


371 


7.0 


375 


6,5 


379 


3.8 


383 


1.1 


387 


.5 


D12S2522CN = 93) 




281 


9 J 


283 


14.0 


285 


.5 


287 


11.3 


293 


.5 


295 


15.6 


297 


44.6 


299 


4.3 


D12S2523 (n = 89) 




305 


18.9 


314 


41.1 


317 


8.9 


320 


30.0 


323 


l.l 
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Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


1* requency (t^j 


U 1 1 / V" / 




1 ou 


8.5 


1 R*? 
1 OjL 


7.5 




1 S 4 


1 HA 




1 oo 


117 


ion 


8.0 


1 o 
1 yz 


5 3 


1 0^ 
1 y** 


1.1 


1 OA 


1. 1 


1 0ft 


3.2 




0.5 




3.7 


904 


6.9 


9 0/5 


6.9 


9 OR 

Zl/O 


4.3 


9 1 A 


9 1 


Z LZ 


"X 9 


Z|4 




Z 10 




DlzSl04o tn — 89) 




Til 
Z 1 1 


•?9 9 
J /.Z 


z 14 


91 I 
Z 1 . 1 


Zl / 


1*7 St 




9 9 


ZZJ 


fi 7 


ZZO 


1 1.7 


229 


3.3 


D12SI30I (n = 93} 




96 


0.5 


100 


37.2 


104 


17.6 


108 


11.1 


112 


12.2 


116 


133 


120 


7.5 


124 


0.5 
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Table 2. Allele frequencies of Park 8 Markers 


iVIarKer and allele tDp> 


rrequcnvy \fo§ 


DI2S170I (n'=93) 




89 


4.3 


91 


4.8 


93 


10.8 


95 


40.0 


97 


16.0 


99 


12.4 


101 


11.8 


103 


0.5 



A The number of individuals genotyped is given for each marker (n) 
B Alle frequencies are for individual markers in U.S. control subjects 



5 SiaiisHcal Analysis 

Multipoint nonparametric LOD scores for all families were calculated using 

GENEHUNTER-PLUS (Kong and Cox 1997), The frequency of the deleterious allele was 

set at 0.0001, and empirically determined allele frequencies were employed. The map 

positions for each maricer were taken from Rutgers combined linkage-physical map version 
10 1.0 (MAP-O-MAT web site). The three loci D12S2080, D12S2194 and D12S1301 are very 

tightly linked, with no observed recombinants in the database or within our genotyped 

families, and thus inter-marker distances were assigned as O.OlcM. 

Chromosome 12 haplotypes in the PARKS region were established for those families in 

which chromosome phase for mutation-carrying individuals could be deduced, thereby 
1 5 determining which alleles co-segregated with the LRRK2 G20 1 9S mutation in each family. 

For those affected individuals in whom the associated allele for a marker could not be 

determined, both alleles are given. 

The age-dependent penetrance was estimated as the probability of a gene carrier 

becoming affected, at a given age, within the .13 families. The number of affected mutation 
20 carriers, for each decade, was divided by the total number of affected individuals, plus the 

number of unaffected carriers within that range. For some affected family members no 

DNA was available and only historical data on the disease course was obtained. These 

individuals were excluded from penetrance calculations. 

2S Results 

As mentioned previously, we identified 13 affected probands (i.e. 13 patients) who 
carry a heterozygous G6055A mutation in exon 41 of the LRRK2 gene. The mutation leads 
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to a G2019S amino acid substitution of a highly conserved residue within the predicted 
activation loop of the MAPKKK (Mitogen-Activated Protein Kinase Kinase Kinase) 
domain (figure 1). After genotyping a total of 42 additional family members, 22 additional 
subjects were found to carry the mutation, seven with a diagnosis of PD (shown in table 3 

5 below). One affected member of family P-089 did not carry the mutation and, for the 
purposes of this study, was considered a phenocopy and excluded from further analyses. 
Seven families originated from Norway, three were from the United States, two from 
Ireland, and one was from Poland. One family from the United States descended from 
Russian/Rumania, and another from Italy. For only one family (family 1 1 1), the ethnic 

1 0 origin was unknown. The LRRK2 G2019S mutation segregates with disease in all kindreds, 
consistent with autosomal dominant transmission. To ensure patient confidentiality, 
simplified versions of the family pedigrees are presented in figure 2. There was no evidence 
of the mutation in the 2260 control samples. 

Age at onset of clinical symptoms was quite variable, even within the same family. Family 
15 1 1 20, a family from the United States, had both the earliest and latest age at onset for a 
patient. The youngest affected subject had an onset at 39 years, whereas the oldest carrier 
presented with initial symptoms at 78 years. Where recorded, most LRRK2 G2019S carriers 
have late-onset disease (>50 years at onset). The mean age at onset of affected mutation 
carriers was 56.8 years (range 39-78 years, n=19). Unaffected carriers have a mean age of 
20 53.9 years (range 26-74 years, n-14). The penetrance of the mutation was found to be 

highly age-dependent, increasing fix)m 17% at the age of 50 to 85% at the age of 70 (figure 
4). 
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All affected members from the different families, except the individual in family P-089 
who did not carry the mutation, appear to share a conmion haplotype on chromosome 12 
the LRRK2 gene locus (figure 3). Haplotypes can be established with certainty in nine of 
the families, and all mutation carriers in these families share alleles for foxir STR markers 
5 and 4 single nucleotide polymorphisms (SNPs) in the LRRK2 gene locus. These markers 
are LRRK2 D12S2516, D12S2518, D12S2519, D12S2520 and SNPs rs7966550, 
rsl427263, rsl 1176013, rsl 1564148. For the remaining families, the number of available 
samples from relatives was not sufficient to determine phase. However, the genotypes in 
these cases are consistent with a common LRRK2 G2019S allele. D12S2516 is located in 
10 intron 29 and D12S2518 is located in intron 44 of the LRRK2 gene, whereas the two other 
shared markers are positioned 3' of the gene. Using the physical position of the shared and 
non-shared markers, the size of tfie shared haplotype is between 145 kb and 154 kb. 

Discussion 

1 5 We have identified a novel LRRK2 mutation, G2019S, which co-segregates with 
autosomal dominant parkinsonism in 13 kindreds origuiating from several European 
populations. Positive LOD scores were obtained in multiplex &milies, and combined they 
provide significant support for the PARKS locus. LRRK2 G2019S mutation was absent in a 
large number of control subjects, and of similar ethnicity. The number of families linked to 

20 LRRK2 in this and previous studies now explains the majority of genetically defined 
autosomal dominant parkinsonism. 

The mean age at onset of affected LRRK2 G2019S carriers was 56.8 years, and 
comparable to that of patients in other families linked to PARKS (Funayama et al. 2002; 
Paisan-Ruiz et al. 2004; Zimprich et al. 2004a). The majority of patients present with late- 

25 onset disease, indistinguishable from typical idiopathic PD. Disease penetrance is age- 
dependent, and increases in a linear fashion from 17% at the age of 50 to 85% at the age of 
70. Age is the single most consistent risk factor for development of PD and other 
neurodegenerative disorders (Lang and Lozano 1998), and an important risk factor in 
LRRK2 associated parkinsonism. Interestingly, age at onset was variable in this study, both 

30 within and between different families, suggesting other susceptibility factors, 
environmental or genetic, may influence the phenotype. 

Although our findings clearly indicate that LRRK2 mutations account for a substantial 
proportion of familial late-onset parkinsonism, historically, cross-sectional twin studies 
have not supported a genetic etiology for late-onset PD (Tanner et al. 1999; Wirdefeldt et 



14 

al. 2004). The age-associated penetrance of LRRK2 mutations provides some explanation 
as even large and well designed twin studies are underpowered to detect incompletely 
penetrant mutations (Simon et al. 2002), LRRK2 mutations were also found in apparently 
sporadic PD patients; three of the patients in this study did not have any known affected 
5 first- or second-degree relatives. However, a caveat of age-dependent penetrance is that 
carriers may die of other diseases, before manifesting or being diagnosed with PD. Thus, it 
seems difficult to separate sporadic and familial PD, or to hypothesize environmental 
causes to be more important in one group and genetic causes more prominent in the other. 
In light of these results, a family history of parkinsonism, previously considered an 

10 exclusion criterion for a diagnosis of PD, must be reconsidered (Hughes et al. 1992). 

LRRK2 is a member of the recently defined ROCO protein femily (Bosgraaf and Van 
Haastert 2003). In human, mouse and rat, members of the ROCO protein family have five 
conserved domains (figure 1). The kinase domain belongs to the MAPKKK subfamily of 
kinases. The active sites of all kinases are located in a cleft between an N-terminal and a C- 

1 5 terminal lobe, typically covered by an 'activation loop', in an inactive conformation. The 
activation loop must undergo, crucial structural changes to allow access to peptide 
substrates and to orientate key catalytic amino acids ^use and Kuriyan 2002). In different 
kinases, the activation loop starts and ends with the conserved residues asp-phe-gly (DFG) 
and ala-pro-glu (APE), respectively (Dibb et al. 2004). Of note, the LRRK2 G2019S 

20 substitution changes a highly conserved amino acid at the start of this loop (figure S). In a 
German family we previously described, an I2020T mutation is located in an adjacent 
codon (Zimprich et al. 2004a). In other kinases, oncogenic mutations in residues within the 
activation loop of the kinase domain have an activating effect (Davies et al. 2002), thus we 
postulate LRRK2 G2019S and I2020T mutations may have an effect on its kinase activity. 

25 The age of an allele may be estimated fi-om the genetic variation among different copies 
(intra-allelic variation), or from its firequency (Slatkin and Rannala 2000). However, the 
local recombination rate on chromosome 12ql2 is unknown, as is the firequency of the 
G2019S mutation in the general population. Nevertheless, at centromeres there is generally 
a dearth in recombination; indeed no crossovers have been observed between LRRK2 

30 flanking markers D12S2194 and D12S1048 in our studies, or within CEPH families (N4AP- 
O-MAT web site). The physical size of the shared haplotype is also small, between 145 kb 
and 1 54 kb, and the allele is widespread in families from several European populations. 
Hence, the mutation is likely to be ancient and may be relatively common in specific 
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populations. These data suggest a substantial proportion of late-onset PD will have a 
genetic basis. 

5 Electronic-Database Information 

The physical position of markers is from NCBI build 34. Accession nimibers and URLs for 
data presented herein are as follows: 

Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlra.nih.gov/Omim/ 
1 0 MAP-O-MAT, http://compgen.rutgers.edu/mapomat 
RepeatMasker, http://www.repeatmasker.org/ 
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Patent claims 

1 . A polynucleotide consisting of the base sequence of SEQ ID NO: 2, or a complementary 
strand thereto, wherein the X is one of the group being defined by the bases A, C or T 

5 

2. A polynucleotide according to claim 1, wherein the polynucleotide is at the least a part of 
a gene. 

3. A recombinant vector comprising apolynucleotid according to claim 1. 

10 

4. A DNA probe specific for the polynucleotide of claim 1, wherein it contains more than 
10 consecutive nucleotides from the nucleotide, or the complementary strand. 

5. A method of proving parkinsonism inheritance, by screening a sample of material taken 
1 5 ftom the subject of interest, with a probe according to claim 5. 

6. DNA primer specific for the polynucleotide of claim 1, wherein it contains more than 10 
consecutive nucleotides firom the nucleotide, or the complementary strand. 

20 7. Use of a polynucleotide according to claim 1, or a vector according to claim 4, to 
transfect an organism, 

8. Use according to claim 7, wherein the organism is a mammal. 
25 9. A peptide consisting of the base sequence of SEQ ID NO: 1 , wherein the x is not glycine. 
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Abstract: 

A polynucleotide consisting of the base sequence of SEQ ID NO: 2. or a complementary 
strand thereto, v^iierein the X is one of the group being defined by the bases A, C or T. A 
primer and a probe specific for that polynucleotide, wherein the primer and/or probe 
contains at the least 10 consecutive nucleotides, and finally use of the probe for proving 
parkinsonism inheritance. 
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SEQUENCE LISTING 

<110> Aasly, Jan O 

WszoXek, Zbigniew K 
Farrer, Matthew J 

<120> Polynucleotide 

<130> U1913 

<150> NO20045612 
<131> 2004-12-23 

<160> 2 

<170> Patentin version 3,1 

<21G> 1 

<211> 2527 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> MISC_FEATURE 

<222> (2019) . . (2019) 

<223> X may be any amino acid except glycine. 



<300> 

<308> Genbank / AY792511 
<309> 2004-11-15 
<313> . (1) . . (2527) 

<400> 1 

Met Ala Ser Gly Ser Cys Gin Gly Cys Glu Glu Asp Glu Glu Thr Leu 
15 10 15 



Lys Lys Leu lie Val Arg Leu Asn Asn Vai Gin Glu Gly Lys Gin He 
20 25 30 



Glu Thr Leu Val Gin He Leu Glu Asp Leu Leu Val Phe Thr Tyx Ser 
35 40 45 



Glu His Ala Ser Lys Leu Phe Gin Gly Lys Asn He His Val Pro Leu 
50 55 60 



Leu He Val Leu Asp Ser Tyr Met Arg Val Ala Ser Val Gin Gin Val 
65 70 75 80 



Gly Trp Ser Leu Leu Cys Lys Leu He Glu Val Cys Pro Gly Thr Met 
85 90 95 



Gin Ser Leu Met Gly Pro Gin Asp Val Gly Asn Asp Trp Glu Val Leu 
100 105 110 



Gly Val His Gin Leu lie Leu Lys Met Leu Thr Val His Asn Ala Ser 
115 120 125 



Val Asn Leu Ser Val lie Gly Leu Lys Thr Leu Asp Leu Leu Leu Thr 
130 135 140 

Ser Gly Lys He Thr Leu Leu lie Leu Asp Glu Glu Ser Asp He Phe 
145 150 155 160 

Met Leu He Phe Asp Ala Met His Ser Phe Pro Ala Asn Asp Glu Val 
165 170 175 



Gin Lys Leu Gly Cys Lys Ala Leu His Val Leu Phe Glu Arg Val Ser 
180 185 190 



Glu Glu Gin Leu Thr Glu Phe Val Glu Asn Lys Asp Tyr Met He Leu 
195 200 205 



Leu Ser Ala Ser Thr Asn Phe Lys Asp Glu Glu Glu He Val Leu His 
210 215 220 



Val Leu His Cys Leu His Ser Leu Ala He Pro Cys Asn Asn Val Glu 
225 230 235 240 

Val Leu Met Ser Gly Asn Val Arg Cys Tyr Asn He Val Val Glu Ala 
245 250 255 



Met Lys Ala Phe Pro Met Ser Glu Arg He Gin Glu Val Ser Cys Cys 
260 265 270 



Leu Leu His Arg Leu Thr Leu Gly Asn Phe Phe Asn He Leu Val Leu 
275 280 285 



Asn Glu Val His Glu Phe Val Val Lys Ala Val Gin Gin Tyr Pro Glu 
290 295 300 



Asn Ala Ala Leu Gin He Ser Ala Leu Ser Cys Leu Ala Leu Leu Thr 
305 310 315 320 



Glu Thr He Phe Leu Asn Gin Asp Leu Glu Glu Lys Asn Glu Asn Gin 
325 330 335 



Glu Asn Asp Asp Glu Gly Glu Glu Asp Lys Leu Phe Trp Leu Glu Ala 
340 345 350 



Cys Tyr Lys Ala Leu Thr Trp His Arg Lys Asn Lys His Val Gin Glu 
355 360 365 



Ala Ala Cys Trp Ala Leu Asn Asn Leu Leu Met Tyr Gin Asn Ser Leu 
370 375 380 



His Glu Lys lie Gly Asp Glu Asp Gly His Phe Pro Ala His Arg Glu 
385 390 395 400 



Val Met Leu Ser Met Leu Met His Ser Ser Ser Lys Glu Val Phe Gin 
405 410 415 



Ala Ser Ala Asn Ala Leu Ser Thr Leu Leu Glu Gin Asn Val Asn Phe 
420 425 430 



Arg Lys lie Leu Leu Ser Lys Gly lie His Leu Asn Val Leu Glu Leu 
435 440 445 



Met Gin Lys His lie His Ser Pro Glu Val Ala Glu Ser Gly Cys Lys 
450 455 460 



Met Leu Asn His Leu Phe Glu Gly Ser Asn Thr Ser Leu Asp lie Met 
465 470 475 480 



Ala Ala Val Val Pro Lys lie Leu Thr Val Met Lys Arg His Glu Thr 
485 490 495 



Ser Leu Pro Val Gin Leu Glu Ala Leu Arg Ala lie Leu His Phe He 
500 505 510 



Val Pro Gly Met Pro Glu Glu Ser Arg Glu Asp Thr Glu Phe His His 
515 520 525 



Lys Leu Asn Met Val Lys Lys Gin Cys Phe Lys Asn Asp He His Lys 
530 535 540 



Leu Val Leu Ala Ala Leu Asn Arg Phe He Gly Asn Pro Gly He Gin 
545 550 555 560 



Lys Cys Gly Leu Lys Val He Ser Ser He Val His Phe Pro Asp Ala 
565 570 575 



Leu Glu Met Leu Ser Leu Glu Gly Ala Met Asp Ser Val Leu His Thr 
580 585 590 



Leu Gin Met Tyr Pro Asp Asp Gin Glu lie Gin Cys Leu Gly Leu Ser 
595 600 605 



Leu lie Gly Tyr Leu lie Thr Lys Lys Asn Val Phe He Gly Thr Gly 
610 615 620 

His Leu Leu Ala Lys He Leu Val Ser Ser Leu Tyr Arg Phe Lys Asp 
625 630 635 640 

Val Ala Glu He Gin Thr Lys Gly Phe Gin Thr He Leu Ala He Leu 
645 650 655 



Lys Leu Ser Ala Ser Phe Ser Lys Leu Leu Val His His Ser Phe Asp 
660 665 670 



Leu Val He Phe His Gin Met Ser Ser Asn He Met Glu Gin Lys Asp 
675 680 685 

Gin Gin Phe Leu Asn Leu Cys Cys Lys Cys Phe Ala Lys Val Ala Met 
690 695 700 

Asp Asp Tyr Leu Lys Asn Val Met Leu Glu Arg Ala Cys Asp Gin Asn 
705 710 715 720 



Asn Ser He Met Val Glu Cys Leu Leu Leu Leu Gly Ala Asp Ala Asn 

725 730 735 



Gin Ala Lys Glu Gly Ser Ser Leu He Cys Gin Val Cys Glu Lys Glu 

740 745 750 

Ser Ser Pro Lys Leu Val Glu Leu Leu Leu Asn Ser Gly Ser Arg Glu 
755 760 765 



Gin Asp Val Arg Lys Ala Leu Thr He Ser He Gly Lys Gly Asp Ser 
770 775 780 



Gin He He Ser Leu Leu Leu Arg Arg Leu Ala Leu Asp Val Ala Asn 
785 790 795 800 



Asn Ser He Cys Leu Gly Gly Phe Cys He Gly Lys Val Glu Pro Ser 
805 810 815 



Trp Leu Gly Pro Leu Phe Pro Asp Lys Thr Ser Asn Leu Arg Lys Gin 
820 825 830 



Thr Asn He Ala Ser Thr Leu Ala Arg Met Val He Arg Tyr Gin Met 



835 



840 



845 



Lys Ser Ala Val Glu Glu Gly Thr Ala Ser Gly Ser Asp Gly Asn Phe 
850 855 860 

Ser Glu Asp Val Leu Ser Lys Phe Asp Glu Trp Thr Phe He Pro Asp 
865 870 875 880 

Ser Ser Met Asp Ser Val Phe Ala Gin Ser Asp Asp Leu Asp Ser Glu 
885 890 895 

Gly Ser Glu Gly Ser Phe Leu Val Lys Lys Lys Ser Asn Ser He Ser 
900 905 910 

Val Gly Glu Phe Tyr Arg Asp Ala Val Leu Gin Arg Cys Ser Pro Asn 
915 920 925 

Leu Gin Arg His Ser Asn Ser Leu Gly Pro He Phe Asp His Glu Asp 
930 935 940 

Leu Leu Lys Arg Lys Arg Lys He Leu Ser Ser Asp Asp Ser Leu Arg 
945 950 955 960 

Ser Ser Lys Leu Gin Ser His Met Arg His Ser Asp Ser He Ser Ser 
965 970 975 

Leu Ala Ser Glu Arg Glu Tyr He Thr Ser Leu Asp Leu Ser Ala Asn 
980 985 990 

Glu Leu Arg Asp He Asp Ala Leu Ser Gin Lys Cys Cys He Ser Val 
995 1000 1005 

His Leu Glu His Leu Glu Lys Leu Glu Leu His Gin Asn Ala Leu 
1010 1015 1020 

Thr Ser Phe Pro Gin Gin Leu .Cys Glu Thr Leu Lys Ser Leu Thr 
1025 1030 1035 

His Leu Asp Leu His Ser Asn Lys Phe Thr Ser Phe Pro Ser Tyr 
1040 1045 1050 

Leu Leu Lys Met Ser Cys He Ala Asn Leu Asp Val Ser Arg Asn 
1055 1060 1065 



Asp He Gly Pro Ser Val Val Leu Asp Pro Thr Val Lys Cys Pro 
1070 1075 1080 



Thr Leu Lys Gin Phe Asn Leu Ser Tyr Asn Gin Leu Ser Phe Val 
1085 1090 1095 



Pro Glu Asn Leu Thr Asp Val Val Glu Lys Leu Glu Gin Leu lie 
1100 1105 1110 



Leu Glu Gly Asn Lys lie Ser Gly lie Cys Ser Pro Leu Arg Leu 
1115 1120 1125 



Lys Glu Leu Lys lie Leu Asn Leu Ser Lys Asn His He Ser Ser 
1130 1135 1140 



Leu Ser Glu Asn Phe Leu Glu Ala Cys Pro Lys Val Glu Ser Phe 
1145 1150 1155 



Ser Ala Arg Met Asn Phe Leu Ala Ala Met Pro Phe Leu Pro Pro 
1160 1165 1170 

Ser Met Thr He Leu Lys Leu Ser Gin Asn Lys Phe Ser Cys He 
1175 1180 1185 



Pro Glu Ala He Leu Asn Leu Pro His Leu Arg Ser Leu Asp Met 
1190 1195 1200 



Ser Ser Asn Asp He Gin Tyr Leu Pro Gly Pro Ala His Trp Lys 
1205 1210 1215 



Ser Leu Asn Leu Arg Glu Leu Leu Phe Ser His Asn Gin He Ser 
1220 1225 1230 



He Leu Asp Leu Ser Glu Lys Ala Tyr Leu Trp Ser Arg Val Glu 
1235 1240 1245 



Lys Leu His Leu Ser His Asn Lys Leu Lys Glu He Pro Pro Glu 
1250 1255 1260 



He Gly Cys Leu Glu Asn Leu Thr Ser Leu Asp Val Ser Tyr Asn 
1265 1270 1275 



Leu Glu Leu Arg Ser Phe Pro Asn Glu Met Gly Lys Leu Ser Lys 
1280 1285 1290 



He Trp Asp Leu Pro Leu Asp Glu Leu His Leu Asn Phe Asp Phe 
1295 1300 1305 



Lys His He Gly Cys Lys Ala Lys Asp He He Arg Phe Leu Gin 
1310 1315 1320 



Gin Arg Leu Lys Lys Ala Val Pro Tyr Asn Arg Met Lys Leu Met 
1325 1330 1335 



lie Val Gly Asn Thr Gly Ser Gly Lys Thr Thr Leu Leu Gin Gin 
1340 1345 1350 

Leu Met Lys Thr Lys Lys Ser Asp Leu Gly Met Gin Ser Ala Thr 
1355 1360 1365 



Val Gly He Asp Val Lys Asp Trp Pro He Gin He Arg Asp Lys 
1370 1375 1380 



Arg Lys Arg Asp Leu Val Leu Asn Val Trp Asp Phe Ala Gly Arg 
1385 1390 1395 



Glu Glu Phe Tyr Ser Thr His Pro His Phe Met Thr Gin Arg Ala 
1400 1405 1410 

Leu Tyr Leu Ala Val Tyr Asp Leu Ser Lys Gly Gin Ala Glu Val 
1415 1420 1425 



Asp Ala Met Lys Pro Trp Leu Phe Asn He Lys Ala Arg Ala Ser 
1430 1435 1440 



Ser Ser Pro Val He Leu Val Gly Thr His Leu Asp Val Ser Asp 
1445 1450 1455 



Glu Lys Gin Arg Lys Ala Cys Met Ser Lys He Thr Lys Glu Leu 
1460 1465 1470 



Leu Asn Lys Arg Gly Phe Pro Ala He Arg Asp Tyr His Phe Val 
1475 1480 1485 



Asn Ala Thr Glu Glu Ser Asp Ala Leu Ala Lys Leu Arg Lys Thr 
1490 1495 1500 



He He Asn Glu Ser Leu Asn Phe Lys He Arg Asp Gin Leu Val 
1505 1510 1515 



Val Gly Gin Leu He Pro Asp Cys Tyr Val Glu Leu Glu Lys He 
1520 1525 1530 



lie Leu Ser Glu Arg Lys Asn Val Pro He Glu Phe Pro Val He 
1535 1540 1545 



Asp Arg Lys Arg Leu Leu Gin Leu Val Arg Glu Asn Gin Leu Gin 
.1550 1555 1560 

Leu Asp Glu Asn Glu Leu Pro His Ala Val His Phe Leu Asn Glu 
1565 1570 1575 

Ser Gly Val Leu Leu His Phe Gin Asp Pro Ala Leu Gin Leu Ser 
1580 1585 1590 



Asp Leu Tyr Phe Val Glu Pro Lys Trp Leu Cys Lys He Met Ala 
1595 1600 1605 



Gin He Leu Thr Val Lys Val Glu Gly Cys Pro Lys His Pro Lys 
1610 1615 1620 



Gly He He Ser Arg Arg Asp Val Glu Lys Phe Leu Ser Lys Lys 
1625 1630 1635 

Arg Lys Phe Pro Lys Asn Tyr Met Ser Gin Tyr Phe Lys Leu Leu 
1640 1645 1650 



Glu Lys Phe Gin He Ala Leu Pro He Gly Glu Glu Tyr Leu Leu 
1655 1660 1665 



Val Pro Ser Ser Leu Ser Asp His Arg Pro Val He Glu Leu Pro 
1670 1675 . 1680 



His Cys Glu Asn Ser Glu He He He Arg Leu Tyr Glu Met Pro 
1685 1690 1695 



Tyr Phe Pro Met Gly Phe Trp Ser Arg Leu He Asn Arg Leu Leu 
1700 1705 1710 



Glu He Ser Pro Tyr Met Leu Ser Gly Arg Glu Arg Ala I/eu Arg 
1715 1720 1725 



Pro Asn Arg Met Tyr Trp Arg Gin Gly He Tyr Leu Asn Trp Ser 
1730 1735 1740 



Pro Glu Ala Tyr Cys Leu Val Gly Ser Glu Val Leu Asp Asn His 
1745 1750 1755 



Pro Glu Ser Phe Leu Lys He Thr Val Pro Ser Cys Arg Lys Gly 



1760 



17 65 



1770 



Cys lie Leu Leu Gly Gin Val Val Asp His lie Asp Ser Leu Met 
1775 1780 1785 



Glu Glu Trp Phe Pro Gly Leu Leu Glu lie Asp He Cys Gly Glu 
1790 1795 1800 



Gly Glu Thr Leu Leu Lys Lys Trp Ala Leu Tyr Ser Phe Asn Asp 
1805 1810 1815 



Gly Glu Glu His Gin Lys He Leu Leu Asp Asp Leu Met Lys Lys 
1820 1825 1830 



Ala Glu Glu Gly Asp Leu Leu Val Asn Pro Asp Gin Pro Arg Leu 
1835 1840 1845 



Thr He Pro He Ser Gin He Ala Pro Asp Leu He Leu Ala Asp 

1850 1855 I860 

Leu Pro Arg Asn He Met Leu Asn Asn Asp Glu Leu Glu Phe Glu 

1865 1870 1875 



Gin Ala Pro Glu Phe Leu Leu Gly Asp Gly Ser Phe Gly Ser Val 
1880 1885 1890 



Tyr Arg Ala Ala Tyr Glu Gly Glu Glu Val Ala Val Lys He Phe 
1895 1900 1905 



Asn Lys His Thr Ser Leu Arg Leu Leu Arg Gin Glu Leu Val Val 
1910 1915 1920 



Leu Cys His Leu His His Pro Ser Leu He Ser Leu Leu Ala Ala 
1925 1930 1935 



Gly He Arg Pro Arg Met Leu Val Met Glu Leu Ala Ser Lys Gly 
1940 1945 1950 



Ser Leu Asp Arg Leu Leu Gin Gin Asp Lys Ala Ser Leu Thr Arg 
1955 I960 1965 



Thr Leu Gin His Arg He Ala Leu His Val Ala Asp Gly Leu Arg 
1970 1975 1980 



Tyr Leu His Ser Ala Met He He Tyr Arg Asp Leu Lys Pro His 
1985 1990 1995 



Asn Val Leu Leu Phe Thr Leu Tyr Pro Asn Ala Ala lie lie Ala 
2000 2005 2010 



Lys lie Ala Asp Tyr Xaa lie Ala Gin Tyr Cys Cys Arg Met Gly 
2015 2020 2025 



lie Lys Thr Ser Glu Gly Thr Pro Gly Phe Arg Ala Pro Glu Val 
2030 2035 2040 



Ala Arg Gly Asn Val He Tyr Asn Gin Gin Ala Asp Val Tyr Ser 
2045 2050 2055 



Phe Gly Leu Leu Leu Tyr Asp He Leu Thr Thr Gly Gly Arg He 
2060 2065 2070 



Val Glu Gly Leu Lys Phe Pro Asn Glu Phe Asp Glu Leu Glu He 
2075 2080 2085 



Gin Gly Lys Leu Pro Asp Pro Val Lys Glu Tyr Gly Cys Ala Pro 
2090 2095 2100 



Trp Pro Met Val Glu Lys Leu He Lys Gin Cys Leu Lys Glu Asn 
2105 2110 2115 



Pro Gin Glu Arg Pro Thr Ser Ala Gin Val Phe Asp He Leu Asn 
2120 2125 2130 



Ser Ala Glu Leu Val Cys Leu Thr Arg Arg He Leu Leu Pro Lys 
2135 2140 2145 



Asn Val He Val Glu Cys Met Val Ala Thr His His Asn Ser Arg 
2150 2155 2160 



Asn Ala Ser He Trp Leu Gly Cys Gly His Thr Asp Arg Gly Gin 
2165 2170 2175 



Leu Ser Phe Leu Asp Leu Asn Thr Glu Gly Tyr Thr Ser Glu Glu 
2180 2185 2190 



Val Ala Asp Ser Arg He Leu Cys Leu Ala Leu Val His Leu Pro 
2195 2200 2205 



Val Glu Lys Glu Ser Trp He Val Ser Gly Thr Gin Ser Gly Thr 
2210 2215 2220 



Leu Leu Val He Asn Thr Glu Asp Gly Lys Lys Arg His Thr Leu 
2225 2230 2235 



Glu Lys Met Thr Asp Ser Val Thr Cys Leu Tyr Cys Asn Ser Phe 
2240 2245 2250 

Ser Lys Gin Ser Lys Gin Lys Asn Phe Leu Leu Val Gly Thr Ala 
2255 2260 2265 

Asp Gly Lys Leu Ala He Phe Glu Asp Lys Thr Val Lys Leu Lys 
2270 2275 2280 

Gly Ala Ala Pro Leu Lys He Leu Asn He Gly Asn Val Ser Thr 
2285 2290 2295 

Pro Leu Met Cys Leu Ser Glu Ser Thr Asn Ser Thr Glu Arg Asn 
2300 2305 2310 



Val Met Trp Gly Gly Cys Gly Thr Lys He Phe Ser Phe Ser Asn 
2315 2320 2325 

Asp Phe Thr He Gin Lys Leu He Glu Thr Arg Thr Ser Gin Leu 
2330 2335 2340 

Phe Ser Tyr Ala Ala Phe Ser Asp Ser Asn He He Thr Val Val 
2345 2350 2355 



Val Asp Thr Ala Leu Tyr He Ala Lys Gin Asn Ser Pro Val Val 
2360 2365 2370 



Glu Val Trp Asp Lys Lys Thr Glu Lys Leu Cys Gly Leu He Asp 
2375 2380 2385 



Cys Val His Phe Leu Arg Glu Val Met Val Lys Glu Asn Lys Glu 
2390 2395 2400 



Ser Lys His Lys Met Ser Tyr Ser Gly Arg Val Lys Thr Leu Cys 
2405 2410 2415 



Leu Gin Lys Asn Thr Ala Leu Trp He Gly Thr Gly Gly Gly His 
2420 2425 2430 



He Leu Leu Leu Asp Leu Ser Thr Arg Arg . Leu He Arg Val He 
2435 2440 2445 



Tyr Asn Phe Cys Asn Ser Val Arg Val Met Met Thr Ala Gin Leu 
2450 2455 ■ 2460 



Gly Ser Leu I*ys Asn Val Met Leu Val Leu Gly Tyr Asn Arg Lys 
2465 2470 2475 

Asn Thr Glu Gly Thr Gin Lys Gin Lys Glu lie Gin Ser Cys Leu 
2480 2485 2490 

Thr Val Trp Asp He Asn Leu Pro His Glu Val Gin Asn Leu Glu 
2495 2500 2505 

Lys His He Glu Val Arg Lys Glu Leu Ala Glu Lys Met Arg Arg 
2510 2515 2520 



Thr Ser Val Glu 
2525 
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<400> 2 

atggctagtg gcagctgtca ggggtgcgaa gaggacgagg aaactctgaa gaagttgata 60 

gtcaggctga acaatgtcca ggaaggaaaa cagatagaaa cgctggtcca aatcctggag 120 

gatctgctgg tgttcacgta ctccgagcnc gcctccaagt tatttcaagg caaaaatatc 180 

catgtgcctc tgttgatcgt cttggactcc tatatgagag tcgcgagtgt gcagcaggtg 240 

ggttggtcac ttctgtgcaa attaatagaa gtctgtccag gtacaatgca aagcttaatg 300 

ggaccccagg atgttggaaa tgattgggaa gtccttggtg ttcaccaatt gattcttaaa 360 

atgctaacag ttcataatgc cagtgtaaac ttgtcagtga ttggactgaa gaccttagat 420 

ctcctcctaa cttcaggtaa aatcaccttg ctgatactgg atgaagaaag tgatattttc 480 

atgttaattt ttgatgccat gcactcattt ccagccaatg atgaagtcca gaaacttgga 54 0 

tgcaaagctt tacatgtgct gtttgagaga gtctcagagg agcaactgac tgaatttgtt 600 

gagaacaaag attatatgat attgttaagt gcgtcaacaa attttaaaga tgaagaggaa 660 

attgtgcttc atgtgctgca ttgtttacat tccctagcga ttccttgcaa taatgtggaa 720 

gtcctcatga gtggcaatgt caggtgttat aatattgtgg tggaagctat gaaagcattc 780 

cctatgagtg aaagaattca agaagtgagt tgctgtttgc tccataggct tacattaggt 840 

aattttttca atatcctggt attaaacgaa gtccatgagt ttgtggtgaa agctgtgcag 900 

cagtacccag agaatgcagc attgcagatc tcagcgctca gctgtttggc cctcctcact 960 

gagactattt tcttaaatca agatttagag gaaaagaatg agaatcaaga gaatgatgat 1020 

gagggggaag aagataaatt gttttggctg gaagcctgtt acaaagcatt aacgtggcat 1080 

agaaagaaca agcacgtgca ggaggccgca tgctgggcac taaataatct ccttatgtac 1140 

caaaacagtt tacatgagaa gattggagat gaagatggcc atttcccagc tcatagggaa 1200 

gtgatgctct ccatgctgat gcattcttca tcaaaggaag ttttccaggc atctgcgaat 1260 

gcattgtcaa ctctcttaga acaaaatgtt aatttcagaa aaatactgtt atcaaaagga 1320 

atacacctga atgttttgga gttaatgcag aagcatataq attctcctga agtggctgaa 1380 

agtggctgta aaatgctaaa tcatcttttt gaaggaagca acacttccct ggatataatg 1440 

gcagcagtgg tccccaaaat actaacagtt atgaaacgtc atgagacatc attaccagtg 1500 

cagctggagg cgcttcgagc tattttacat tttatagtgc ctggcatgcc agaagaatcc 1560 



agggaggat'a cagaatttca tcataagcta aatatggtta aaaaacagtg tttcaagaat 1620 

gatattcaca aactggtcct agcagctttg aacaggttca ttggaaatcc tgggattcag 1680 

aaatgtggat taaaagtaat ttcttctatt gtacattttc ctgatgcatt agagatgtta 17 4 0 

tccctggaag gtgctatgga ttcagtgctt cacacactgc agatgtatcc agatgaccaa 1800 

gaaattcagt gtctgggttt aagtcttata ggatacttga ttacaaagaa gaatgtgttc I860 

ataggaactg gacatctgct ggcaaaaatt ctggtttcca gcttataccg atttaaggat 1920 

gttgctgaaa tacagactaa aggatttcag acaatcttag caatcctcaa attgtcagca 1980 

tctttttcta agctgctggt gcatcattca tttgacttag taatattcca tcaaatgtct 2040 

tccaatatca tggaacaaaa ggatcaacag tttctaaacc tctgttgcaa gtgttttgca 2100 

aaagtagcta tggatgatta cttaaaaaat gtgatgctag agagagcgtg tgatcagaat 2160 

aacagcatca tggttgaatg cttgcttcta ttgggagcag atgccaatca agcaaaggag 2220 

ggatcttctt taatttgtca ggtatgtgag aaagagagca gtcccaaatt ggtggaactc 2280 

ttactgaata gtggatctcg tgaacaagat gtacgaaaag cgttgacgat aagcattggg 2340 

aaaggtgaca gccagatcat cagcttgctc ttaaggaggc tggccctgga tgtggccaac 2400 

aatagcattt gccttggagg attttgtata ggaaaagttg aaccttcttg gcttggtcct 2460 

ttatttccag ataagacttc taatttaagg aaacaaacaa atatagcatc tacactagca 2520 
agaatggtga tcagatatca gatgaaaagt gctgtggaag aaggaacagc ctcaggcagc 2580 
gatggaaatt tttctgaaga tgtgctgtct aaatttgatg aatggacctt tattcctgac 2640 
tcttctatgg acagtgtgtt tgctcaaagt gatgacctgg atagtgaagg aagtgaaggc 2700 
tcatttcttg tgaaaaagaa atctaattca attagtgtag gagaatttta ccgagatgcc 27 60 
gtattacagc gttgctcacc aaatttgcaa agacattcca attccttggg gcccattttt 
gatcatgaag atttactgaa gcgaaaaaga aaaatactat cttcagatga ttcactcagg 
tcatcaaaac ttcaatccca tatgaggcat tcagacagca tttcttctct ggcttctgag 
agagaatata ttacatcact agacctttca gcaaatgaac taagagatat tgatgcccta 
agccagaaat gctgtataag tgttcatttg gagcatcttg aaaagctgga gcttcaccag 
aatgcactca cgagctttcc acaacagcta tgtgaaactc tgaagagttt gacacatttg 3120 
gacttgcaca gtaataaatt tacatcattt ccttcttatt tgttgaaaat gagttgtatt 3180 
gctaatcttg atgtctctcg aaatgacatt ggaccctcag tggttttaga tcctacagtg 3240 
aaatgtccaa ctctgaaaca gtttaacctg tcatataacc agctgtcttt tgtacctgag 3300 
aacctcactg atgtggtaga gaaactggag cagctcattt tagaaggaaa taaaatatca 3360 



2820 
2880 
2940 
3000 
3060 
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gggntatgct 
atttcatccc 
agaatgaatt 
tctcagaaca 
ttagatatga 
aacttaaggg 
gcatatttat 
cctcctgaga 
ctaagatcct 
gaactgcatc 
tttcttcaac 
ggaaatactg 
gatcttggaa 
agagacaaaa 
ttctatagta 
ctcagcaagg 
ngcgcttctt 
caacgcaaag 
gccatacgag 
cggaaaacca 
cagctgattc 
. gtgccaattg 
cagctgcagt 
gtccttcttc 
aagtggcttt 
caccctaagg 
tttccaaaga 
ccaataggag 
gagcttcccc 
ccaatgggat 
tcagggagag 



cccccttgag 
tatcagagaa 
ttcttgctgc 
aattttcctg 
gcagcaatga 
aactcttatt 
ggtctagagt 
ttggctgtct 
ttcccaatga 
ttaactttga 
agcgattaaa 
ggagtggtaa 
tgcaaagtgc 
gaaagagaga 
ctcatcccca 
gacaggctga 
cttcccctgt 
cctgcatgag 
attaccactt 
tcataaacga 
cagactgcta 
aatttcccgt 
tagatgaaaa 
attttcaaga 
gtaaaatcat 
gcattatttc 
actacatgtc 
aagaatattt 
attgtgagaa 
tttggtcaag 
aacgagcact 



actgaaggaa 
ctttcttgag 
tatgcctttc 
tattccagaa 
tattcagtac 
tagccataat 
agagaaactg 
tgaaaatctg 
aatggggaaa 
ttttaaacat 
aaaggctgtg 
aaccacctta 
cacagttggc 
tctcgtccta 
ttttatgacg 
agttgatgcc 
gattctcgtt 
taaaatcacc 
tgtgaatgcc 
gagccttaat 
tgtagaactt 
aattgaccgg 
tgagcttcct 
cccagcactg 
ggcacagatt 
gcgtagagat 
acagtatttt 
gctggttcca 
ctctgaaatt 
attaatcaat 
tcgcccaaac 



ctgaagattt 
gcttgtccta 
ttgcctcctt 
gcaattttaa 
ctaccaggtc 
cagatcagca 
catctttctc 
acatctctgg 
ttaagcaaaa 
ataggatgta 
ccttataacc 
ttgcagcaat 
atagatgtga 
aatgtgtggg 
cagcgagcat 
atgaagcctt 
ggcacacatt 
aaggaactcc 
accgaggaat 
ttcaagatcc 
gaaaaaatca 
aaacgattat 
cacgcagttc 
cagttaagtg 
ttgacagtga 
gtggaaaaat 
aagctcctag 
agcagtttgt 
atcatccgac 
cgattacttg 
agaatgtatt 



taaaccttag taagaaccac 3420 

aagtggagag tttcagtgcc 3480 

ctatgacaat cctaaaatta 3540 

atcttccaca cttgcggtct 3600 

ccgcacactg gaaatctttg 3660 

tcttggactt gagtgaaaaa 3720 

acaataaact gaaagagatt 3780 

atgtcagtta caacttggaa 3840 

tatgggatct tcctttggat 3900 

aagccaaaga catcataagg 3960 

gaatgaaact tatgattgtg 4020 

taatgaaaac caagaaatca 4 080 

aagactggcc tatccaaata 414 0 

attttgcagg tcgtgaggaa 4200 

tgtaccttgc tgtctatgac 4260 

ggctcttcaa tataaaggct 4320 

tggatgtttc. tgatgagaag 4380 

tgaataagcg agggttccct 4 440 
ctgatgcttt ggcaaaactt 45O0 
gagatcagct tgttgttgga 4 560 
ttttatcgga gcgtaaaaat 4620 
tacaactagt gagagaaaat 4680 
actttctaaa tgaatcagga 4740 
acttgtactt tgtggaaccc 4 BOO 
aagtggaagg ttgtccaaaa 4860 
ttctttcaaa aaaaaggaaa 4 920 
aaaaattcca gattgctttg 4980 
ctgaccacag gcctgtgata 5040 
tatatgaaat gccttntttt 5100 
agatttcacc ttacatgctt 5160 
ggcgacaagg catttactta 5220 
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aattggtctc ctgaagctta ttgtctggta ggatctgaag tcttagacaa tcatccagag 5280 

agtttcttaa aaattacagt tccttcttgt agaaaaggct gtattctttt gggccaagtt 5340 

gtggaccaca ttgattctct catggaagaa tggtttcctg ggttgctgga gattgatatt 5400 

tgtggtgaag gagaaactct gttgaagaaa tgggcattat atagttttaa tgatggngaa 54 60 

gaacatcaaa aaatcttact tgatgacttg atgaagaaag cagaggaagg agatctctta 5520 

gtaaatccag atcaaccaag gctcaccatt ccaatatctc agattgcccc tgacttgatt 5580 

ttggctgacc tgcctagaaa tattatgttg aataatgatg agttggaatt tgaacaagct 5640 

ccagagtttc tcctaggtga tggcagtttt ggatcagttt accgagcagc ctatgaagga 5700 

gaagaagtgg ctgtgaagat ttttaataaa catacatcac tcaggctgtt aagacaagag 57 60 

cttgtggtgc tttgccacct ccaccacccc agtttgatat ctttgctggc agctgggatt 5820 

cgtccccgga tgttggtgat ggagttagcc tccaagggtt ccttggatcg cctgcttcag 5880 

caggacaaag ccagcctcac tagaacccta cagcacagga ttgcactcca cgtagctgat 5940 

ggtttgagat acctccactc agccatgatt atataccgag acctgaaacc ccacaatgtg 6000 

ctgcttttca cactgtatcc caatgctgcc atcattgcaa agattgctga ctacngcatt 6060 

gctcagtact gctgtagaat ggggataaaa acatcagagg gcacaccagg gtttcgtgca 6120 

cctgaagttg ccagaggaaa tgtcatttat aaccaacagg ctgatgttta ttcatttggt 6180 

ttactactct atgacatttt gacaactgga ggtagaatag tagagggttt gaagtttcca 6240 

aatgagtttg atgaattaga aatacaagga aaattacctg atccagttaa agaatatggt 6300 

tgtgccccat ggcctatggt tgagaaatta attaaacagt gtttgaaaga aaatcctcaa 6360 

gaaaggccta cttctgccca ggtctttgac attttgaatt cagctgaatt agtctgtctg 6420 

acgagacgca ttttattacc taaaaacgta attgttgaat gcatggttgc tacacatcac 6480 

aacagcagga atgcaagcat ttggctgggc tgtgggcaca ccgacagagg acagctctca 6540 

tttcttgact taaatactga aggatacact tctgaggaag ttgctgatag tagaatattg 6600 

tgcttagcct tggtgcatct tcctgttgaa aaggaaagct ggattgtgtc tgggacacag 6660 
tctggtactc tcctggtcat caataccgaa gatgggaaaa agagacatac cctagaaaag 6720 
atgactgatt ctgtcacttg tttgtattgc aattcctttt ccaagcaaag caaacaaaaa 6780 
aattttcttt tggttggaac cgctgatggc aagttagcaa tttttgaaga taagactgtt 6840 
aagcttaaag gagctgctcc tttgaagata ctaaatatag gaaatgtcag tactccattg 6900 
atgtgtttga gtgaatccac aaattcaacg gaaagaaatg taatgtgggg aggatgtggc 6960 
acaaagattt tctccttttc taatgatttc accattcaga aactcattga gacaagaaca 7020 



agccaactgt tttcttatgc agctttcagt 
actgctctct atattgctaa gcaaaatagc 
gaaaaactct gtggactaat agactgcgtg 
aacaaggaat caaaacacaa aatgtcttat 
aagaacactg ctctttggat aggaactgga 
actcgtcgac ttatacgtgt aatttacaac 
gcacagctag gaagccttaa aaatgtcatg 
gaaggtacac aaaagcagaa agagatacaa 
ccacatgaag tgcaaaattt agaaaaacac 
atgagacgaa catctgttga gtaa 



gattccaaca tcataacagt ggtggtagac 
cctgttgtgg aagtgtggga taagaaaact 
cactttttaa gggaggtaat ggtaaaagaa 
tctgggagag tgaaaaccct ctgccttcag 
ggaggccata ttttactcct ggatctttca 
ttttgtaatt cggtcagagt catgatgaca 
ctggtattgg gctacaaccg gaaaaatact 
tcttgcttga ccgtttggga catcaatctt 
attgaagtga gaaaagaatt agctgaaaaa 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7584 
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